Entropy-Based Dynamic Rescoring with Language Model in E2E ASR Systems
نویسندگان
چکیده
Language models (LM) have played crucial roles in automatic speech recognition (ASR), whether as an essential part of a conventional ASR system composed acoustic model and LM, or integrated to enhance the performance novel end-to-end systems. With development machine learning deep learning, language modeling has made great progress natural processing applications. In recent years, efforts been leverage advantages LM ASR. The most common way apply integration is still shallow fusion because it can be easily implemented by zero-overhead while obtaining significant improvement. Our method further applicability without hyperparameter tuning maintaining similar performance.
منابع مشابه
An Efficient Approach of Language Model Applying in ASR Systems
Language model plays a pivotal role in large vocabulary speech recognition systems. Providing more syntactic and semantic information, high-level language models hold stronger ability in guiding the search process and hence optimizing the final result. But on the other hand, complex language models, compared with simple ones, usually introduce proportional computing workload that jeopardizes th...
متن کاملASR-based systems for language learning and therapy
ASR-based CALL seems to offer many possibilities for language learning and therapy. However, in both domains the speech of the users generally differs substantially from standard speech. ASR of such atypical speech is complex and challenging. Furthermore, developing successful CALL systems requires a mix of expertise. This combination of factors has led to misconceptions and pessimism on the us...
متن کاملAttacking Paper-Based E2E Voting Systems
In this paper, we develop methods for constructing votebuying/coercion attacks on end-to-end voting systems, and describe votebuying/coercion attacks on three proposed end-to-end voting systems: Punchscan, Prêt-à-voter , and ThreeBallot. We also demonstrate a different attack on Punchscan, which could permit corrupt election officials to change votes without detection in some cases. Additionall...
متن کاملSimulation-based analysis of E2E voting systems
End-to-end auditable voting systems are expected to guarantee very interesting, and often sophisticated security properties, including correctness, privacy, fairness, receipt-freeness, . . . However, for many well-known protocols, these properties have never been analyzed in a systematic way. In this paper, we investigate the use of techniques from the simulation-based security tradition for th...
متن کاملFuzzy class rescoring: a part-of-speech language model
Current speech recognition systems usually use word-based trigram language models. More elaborate models are applied to word lattices or N best lists in a rescoring pass following the acoustic decoding process. In this paper we consider techniques for dealing with class-based language models in the lattice rescoring framework of our JANUS large vocabulary speech recognizer. We demonstrate how t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2022
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app12199690